JIGSAW: integration of multiple sources of evidence for gene prediction

نویسندگان

  • Jonathan E. Allen
  • Steven Salzberg
چکیده

MOTIVATION Computational gene finding systems play an important role in finding new human genes, although no systems are yet accurate enough to predict all or even most protein-coding regions perfectly. Ab initio programs can be augmented by evidence such as expression data or protein sequence homology, which improves their performance. The amount of such evidence continues to grow, but computational methods continue to have difficulty predicting genes when the evidence is conflicting or incomplete. Genome annotation pipelines collect a variety of types of evidence about gene structure and synthesize the results, which can then be refined further through manual, expert curation of gene models. RESULTS JIGSAW is a new gene finding system designed to automate the process of predicting gene structure from multiple sources of evidence, with results that often match the performance of human curators. JIGSAW computes the relative weight of different lines of evidence using statistics generated from a training set, and then combines the evidence using dynamic programming. Our results show that JIGSAW's performance is superior to ab initio gene finding methods and to other pipelines such as Ensembl. Even without evidence from alignment to known genes, JIGSAW can substantially improve gene prediction accuracy as compared with existing methods. AVAILABILITY JIGSAW is available as an open source software package at http://cbcb.umd.edu/software/jigsaw.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Jigsaw: A good student-centered method in medical education

Introduction: Today, student-centered methods must be used to train students with professional competency. One of the most valuable methods is Jigsaw (JT). Despite its various positive effects on students’ learning, not all teachers are familiar with Jigsaw. In order to familiarize teachers with this method and encourage them to use it in teaching their students, this article introduces Jigsaw,...

متن کامل

Language Skill-Task Corollary: The Effect of Decision-Making vs. Jigsaw Tasks on Developing EFL Learners’ Listening and Speaking Abilities

Task-based language Teaching (TBLT) has occupied the pertinent literature for some long years. However, the role of specific task type in developing specific skill type seems to be amongst the intact issues in the literature. To shed more light on this issue, the present study was conducted to compare the effect of jigsaw and decision-making tasks on improving listening and speaking abilities o...

متن کامل

The Impact of Skill Integration on Task Involvement Load

The present study investigated whether word learning and retention in a second language are contingent upon a task's involvement load, i.e., the amount of need, search, and evaluation the task imposes. Laufer and Hulstijn (2001) contend that tasks with higher degrees of these three components induce higher involvement load, and are, therefore, more effective for word learning. To test this clai...

متن کامل

Effect of Jigsaw Technique on the Education of Menstrual Self-care Behaviour to Female Adolescents

Background: It is essential to keep the reproductive organs and surrounding areas hygienic during menstruation to prevent health issues. Inadequate menstrual self-care knowledge, poor attitudes, and behavior among female adolescents can lead to increased morbidity and other complications among them, such as reproductive tract infections. Aim: This study a...

متن کامل

Noise tolerance of Multiple Classifier Systems in data integration-based gene function prediction

The availability of various high-throughput experimental and computational methods developed in the last decade allowed molecular biologists to investigate the functions of genes at system level opening unprecedented research opportunities. Despite the automated prediction of genes functions could be included in the most difficult problems in bioinformatics, several recently published works sho...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 21 18  شماره 

صفحات  -

تاریخ انتشار 2005